Recovering Nonuniform Planted Partitions via Iterated Projection
نویسنده
چکیده
In the planted partition problem, the n vertices of a random graph are partitioned into k “clusters,” and edges between vertices in the same cluster and different clusters are included with constant probability p and q, respectively (where 0 ≤ q < p ≤ 1). We give an efficient spectral algorithm that recovers the clusters with high probability, provided that the sizes of any two clusters are either very close or separated by ≥ Ω( √ n). We also discuss a generalization of planted partition in which the algorithm’s input is not a random graph, but a random real symmetric matrix with independent above-diagonal entries. Our algorithm is an adaptation of a previous algorithm for the uniform case, i.e., when all clusters are size n/k ≥ Ω( √ n). The original algorithm recovers the clusters one by one via iterated projection: it constructs the orthogonal projection operator onto the dominant kdimensional eigenspace of the random graph’s adjacency matrix, uses it to recover one of the clusters, then deletes it and recurses on the remaining vertices. We show herein that a similar algorithm works in the nonuniform case.
منابع مشابه
Spectral thresholds in the bipartite stochastic block model
1 We consider a bipartite stochastic block model on vertex sets V1 and V2, with planted partitions in each, and ask at what densities efficient algorithms can recover the partition of the smaller vertex set. When |V2| |V1|, multiple thresholds emerge. We first locate a sharp threshold for detection of the partition, in the sense of the results of Mossel et al. (2014, 2013) and Massoulié (2014) ...
متن کاملBayesian Vision for Shape Recovery
We present a new Bayesian vision technique that aims at recovering a shape from two or more noisy observations taken under similar lighting conditions. The shape is parametrized by a pieceviise linear height field, textured by a piecewise linear irradiance field, and we assume Gaussian Markovian priors for both shape vertices and irradiance variables. The observation process, also known as rend...
متن کاملWised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge
The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...
متن کاملSqueezable Orthogonal Bases: Accuracy and Smoothness
We present a method for generating local orthogonal bases on arbitrary partitions of R from a given local orthogonal shift-invariant basis via what we call a squeeze map. We give necessary and suÆcient conditions for a squeeze map to generate a non-uniform basis that preserves any smoothness and/or accuracy (polynomial reproduction) of the shift-invariant basis. When the shift-invariant basis h...
متن کاملA Simple Spectral Algorithm for Recovering Planted Partitions
In this paper, we consider the planted partition model, in which n = ks vertices of a random graph are partitioned into k “clusters,” each of size s. Edges between vertices in the same cluster and different clusters are included with constant probability p and q, respectively (where 0 ≤ q < p ≤ 1). We give an efficient algorithm that, with high probability, recovers the clustering as long as th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1708.06783 شماره
صفحات -
تاریخ انتشار 2017